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Abstract 

Dempster-Shafer theory is widely applied to uncertainty modelling and knowl- 
edge reasoning due to its ability of expressing uncertain information. How- 
ever, some conditions, such as exclusiveness hypothesis and completeness 
constraint, limit its development and application to a large extend. To over- 
come these shortcomings in Dempster-Shafer theory and enhance its capa- 
bility of representing uncertain information, a novel theory called D numbers 
theory is systematically proposed in this paper. Within the proposed theory, 
uncertain information is expressed by D numbers, reasoning and synthesiza- 
tion of information are implemented by D numbers combination rule. The 
proposed D numbers theory is an generalization of Dempster-Shafer theory, 
which inherits the advantage of Dempster-Shafer theory and strengthens its 
capability of uncertainty modelling. 
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1. Introduction 



Since first proposed by Dempster 



.11] and then developed by Shafer 2], 



Dempster-Shafer theory of evidence , also called Dempster-Shafer theory or 
evidence theory, has been paid much attentions for a long time and contin- 
ually attracted growing interests. This theory needs weaker conditions than 
the Bayesian theory of probability, so it is often regarded as an extension of 



the Bayesian theory 



3, 4, 5|, 6]. Many studies have been devoted to further 
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Even as a theory of reasoning under the uncertain environment, Dempster- 
Shafer theory has an advantage of directly expressing the “uncertainty” by 
assigning the probability to the subsets of the set composed of multiple ob- 
jects, rather than to each of the individual objects. However, it is also con- 
strained by many strong hypotheses and hard constraints which limit its 
development and application to a large extend. For one hand, the elements 
in a frame of discernment (FOD) are required to be mutually exclusive, ft is 
called exclusiveness hypothesis. For another, the sum of basic probabilities of 
a mass function must be equal to 1, which is called completeness constraint. 
In the following of this paper, we will show how these conditions limit the 
application of Dempster-Shafer theory. 

To overcome these shortcomings in Dempster-Shafer theory and strengthen 
its capability of representing uncertain information, a novel theory called D 



numbers theory is systematical 
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149] is used to model uncertain informa- 



tion. What’s more, a D numbers combination rule is proposed to synthesize 
all the information expressed by D numbers and implement the uncertainty 
and knowledge reasoning. Actually, D numbers and D numbers combination 
rule is an extension of mass function and Dempster’s rule of combination, 
respectively. If meeting certain conditions, they will degenerate to classi- 
cal mass function and Dempster’s rule of combination. Consequently, the 
proposed D numbers theory is an generalization of Dempster-Shafer theory. 

The rest of this paper is organized as follows. Section [2] gives a brief 



3 



introduction about the Dempster- Shafer theory. In Section [31 the proposed 
D numbers theory is presented, mainly including D numbers and D numbers 
combination rule. Some numerical examples are given to show the application 
of D numbers theory in Section [H Finally, conclusions are given in Section 

El 

2. Dempster-Shafer theory 

For completeness of the explanation, a few basic concepts about Dempster- 
Shafer theory are introduced as follows. 

For a finite nonempty set D = {H\, # 2 , • • • , Hn}, 0 is called a frame of 
discernment (FOD) when satisfying 

Hi ft Hj = 0, Vi,j = {1, - • • , N}. (1) 

Let 2 n be the set of all subsets of D, namely 

2 n = {A | A C D}. (2) 

2 n is called the power set of D. For a FOD D, a mass function is a mapping 
m from 2 n to [0, 1], formally defined by: 

m: 2 n — ► [0, 1] (3) 

which satisfies the following condition: 

m(0) = 0 and ^ m(A) = 1 (4) 

Ag 2 n 
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In Dempster-Shafer theory, a mass function is also called a basic probability 
assignment (BPA). Given a BPA, the belief function Bel : 2 n — > [0,1] is 
defined as 

Bel(A) = m ( B ) (5) 

BCA 

The plausibility function PI : 2 n — * [0, 1] is defined as 



Pl(A) = 1 - Bel(A) = J2 m ( B ) 

BrAytd) 



( 6 ) 



where A = — A. These functions Bel and PI express the lower bound and 

upper bound in which subset A has been supported, respectively. 

Given two independent BPAs mi and m 2 , Dempster’s rule of combination, 
denoted by m = ?T7-i©m 2 , is used to combine them and it is defined as follows 



m(A) = 



l—K 



E m 1 (B)m 2 (C) , A ^ 0; 



BnC=A 



(V 



0 , 



A = 



with 



K= ^2 m 1 (B)m2(C) (8) 

snc=0 

Note that the Dempster’s rule of combination is only applicable to such two 
BPAs which satisfy the condition K < 1. 



3. D numbers theory 

In the mathematical framework of Dempster-Shafer theory, there are sev- 
eral strong hypotheses and constraints on the FOD and BPA. However, these 
hypotheses and constraints limit the ability of Dempster-Shafer theory to 
represent uncertain information. 
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First, a FOD must be a mutually exclusive and collectively exhaustive 
set, the elements of FOD are required to be mutually exclusive, as shown in 
Eq.(H]). In many situations, however, it is very difficult to be satisfied. Take 
assessment as an example. In evaluating one object, it often uses linguistic 
variables to express the assessment result, such as “Very Good”, “Good”, 
“Fair”, “Bad” and “Very Bad”. Due to given by human, it inevitably ex- 
ists intersections among these linguistic variables. Therefore, the exclusive- 
ness hypothesis cannot be guaranteed precisely so that the application of 
Dempster-Shafer theory is questionable for such situations. There are al- 
ready some studies about FOD with non-exclusive hypotheses 54, 

Second, the sum of basic probabilities of a normal BPA must be equal to 
1, as shown in Eq. (1Tj) . We call it as completeness constraint. But in some 
cases, due to lack of knowledge and information, it is possible to obtain an 
incomplete BPA whose sum of basic probabilities is less than 1. For example, 
if an assessment is based on little partial information, the lack of information 
may resu 



150], 



t in a complete BPA cannot be obtained. Furthermore, in an open 
world 52], the incompleteness of FOD may also lead to the incompleteness 
of BPA. Hence the completeness constraint is hard to completely meet in 
some cases and it restricts the application of Dempster-Shafer theory. 

To overcome these existing shortcomings in Dempster-Shafer theory and 
enhance its capability of expressing uncertain information, a novel theory, 
named as D numbers theory, is systematically proposed. D numbers theory 
looses FOD’s exclusiveness hypothesis and BPA’s completeness constraint, 



which is a generalization of Dempster-Shafer theory. 
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Definition 1. Let 0 be a nonempty set 0 = { F\ , F 2 Fn} satisfying 

F{ ^ Fj if i ^ j, Vi, j = {1, • • • , iV} , D numbers is a mapping formulated by 

D : 2 e ->■ [0, 1] (9) 

with 

D(B) < 1 and D(0) = 0 (10) 

see 

where 0 is the empty set and B is a subset of 0. 

ft is found that the definition of D numbers is similar with the definition of 

BPA. But note that, at first, differ from the definition of FOD in Dempster- 

Shafer theory, the exclusiveness hypothesis is removed, i.e., the elements 

in set 0 don’t require mutually exclusive in D numbers. At second, the 

completeness constraint is released in D numbers. If ^ D(B) = 1, the 

Bee 

information is said to be complete; if D(B) < 1, the information is said 

Bee 

to be incomplete. The degree of information’s completeness is defined as 
below. 

Definition 2. Let fibeaD number on a finite nonempty set 0, the degree 
of information’s completeness in D is quantified by 

Q=Y,D(B) (11) 

see 

In Dempster-Shafer theory, Dempster’s rule of combination has played a 
central role to synthesize all the knowledge of the initial BPAs. Correspond- 
ingly, in D numbers theory which is treat as a generalization of Dempster- 
Shafer theory, a D numbers combination rule is proposed to combine the 
information indicated by D numbers. 
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Definition 3. Let D 1 and D 2 be two D numbers, the combination of L)\ 



and D- 2 , indicated by D = D\ © D 2 , is defined by 

D(0) = 0 

b ( b ) = iTf. E _ B# 



SiD52=-B 



with 



A'/) = 



X] D 1 (B 1 )D 2 (B 2 



QtQ 2 s in s 2 =0 
Qi = 

Bice 

Q 2 =Y, D ^ B2 

B2QO 



( 12 ) 



(13) 

(14) 

(15) 



The proposed D numbers combination rule is a generalization of Demp- 
ster’s rule of combination. If Di, D 2 are defined on a FOD and Q± = 1, 
Q 2 — 1, the D numbers combination rule will degenerate to the Dempster’s 
rule of combination. This combination rule provides a practical and feasible 
scheme to synthesize the uncertain information modeled by D numbers. 

So far, D numbers theory has been proposed. In summary, it includes two 
main aspects. On the one hand, with respect to representation of uncertain 
information, D numbers provide a useful model. On the other hand, with 
respect to knowledge and uncertainty reasoning, D numbers combination rule 
can be employed to synthesize uncertain information. 



4. Numerical Examples 

In this section, some numerical examples are given to show the applica- 
tions of proposed D numbers theory. 




Example 1 . Assume a local government plans to build a hydropower sta- 
tion nearby a river. Before to implement this project, environmental impact 
assessment (EIA) is carried out, which is to identify and assess the conse- 
quences or potential impacts of human activities to the environment. Two 
groups of experts are employed to execute the task, independently. Assume 
the evaluation result is expressed by linguistic variables High , Medium and 
Low. One group evaluates that the damage of this project to the environment 
is High. The other group’s is Medium. 

If these results are modeled by using Dempster-Shafer theory, two BPAs 
can be obtained that mi(High) = 1, m^Medium) = 1. The Dempster’s 
rule of combination is then used to combine the evaluations given by these 
two groups, ffowever, due to m± and m 2 are completely conflicting, i.e. , 
K = 1, the Dempster’s rule of combination is unable to handle this situation. 
Actually, in Dempster-Shafer theory there is a hypothesis that High and 
Medium are mutually exclusive, i.e., High D Medium = 0, as shown in 
Figure [TJ 




Figure 1: The linguistic variables of High and Medium, in Dempster-Shafer theory 

But in the real situation, it inevitably exists intersections among linguistic 
variables given by human beings. D numbers theory abandons the exclusive- 



9 



ness hypothesis that elements must be mutually exclusive, as shown in Figure 
El In D numbers theory, these evaluation results can be indicated by two D 
numbers that Di(High) = 1, ^(Medium) = 1. The combination of D\ and 
D 2 is expressed by D(Highn Medium) = 1. Therefore, D numbers theory is 
more reasonable and capable to model the imprecise, ambiguous, and vague 
information. 




Figure 2: The linguistic variables of High and Medium in D numbers theory 



Example 2. Medical diagnosis is a typical field that involves various types 
of uncertainties. Assume a patient is with the symptoms of fever, polypnea, 
cough. According to previous cases, it is likely caused by flu (F), Bacterial or 
fungal pneumonia (B), and upper respiratory infection (U). Two independent 
diagnostic reports are submitted by two doctors. One doctor diagnoses that 
the patient got F with a possibility of 0. 1, and got B or U with a possibility 
of 0.2, the reminder 0.1 is unknown. The other doctor’s diagnostic report 
shows that it is 0.5 sure that the patient got F and 0.3 sure that the patient 
got B, the reminder 0.2 is also unknown. The problem is what disease the 
patient got. 

Let’s consider this problem in the framework of Dempster-Shafer theory 
first. According to these two diagnostic reports, two BPAs can be obtained 
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mi(F) = 0.7, mi(B, U ) = 0.2, rrii(F, B, U ) = 0.1; 
m 2 (F) = 0.5, m 2 (B) = 0.3, m 2 (F, B , U ) = 0.2. 



Table 1: Intersection table to combine TOi and m 2 



mi © m 2 


m 2 (F) = 0.5 


m 2 (B) = 0.3 


mi(F, B , U) = 0.2 


mi(F) = 0.7 


{F}(0.35) 


0(0.21) 


(F}(0.14) 


mi(B, U ) = 0.2 


0(0.10) 


{H}(0.06) 


{B,U}(0M) 


rri\ (F, B, U ) = 0.1 


{F}(0.05) 


{B}( 0.03) 


{FB,U}( 0.02) 



The intersection table of m\ © m 2 is shown in Table [D Then, we can 
obtain that 

K = mi(B, U)m 2 (F ) + rrii(F)m 2 (B) = 0.31; 

m(F) = (mi(F)m 2 (F) + m 1 (F, B, U)m 2 (F) + rrii(F)m 2 (F, B, U)) = 

0.7826; 

m{B) = (■ mi(B , U)m 2 (B ) + mi(F, B, U)m 2 (B)) = 0.1304; 

m(B, U ) = Y^jrmi(B, U)m 2 (F , B, U ) = 0.0580; 

m(F , B, U ) = j^mi(F, B, U)m 2 (F, B , [/) = 0.0290; 

In the above calculating process, there is an invisible hypothesis that the 
possibility of unknown is equal to that of {F, B , U}. In other words, the set 
of all diseases causing the symptoms of fever, polypnea and cough, is seen as 
equivalent to the set {F,B,U} which only contains three types of diseases. 
However, it can not be obtained according to the doctors’ diagnostic reports. 
This invisible hypothesis obviously is not reasonable. In the real world, there 
may be other reasons resulting in these symptoms, but they are unknown due 
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to the limitation of human beings’ current knowledge and cognitive level. For 
example, until 2003, SARS (Severe Acute Respiratory Syndrome) is found 
and then the knowledge about disease whose symptoms are fever, polypnea 
and cough is updated. 

Maybe there are some debate about the above discussion. Someone would 
argue that a set X which includes all unknown factors can be imported in 
the construction of BPAs. For example, the first BPA can be constructed 
as rrii(F) = 0.7, mi(B,U) = 0.2, mi(F, B, U, X) = 0.1. By this means, 
the invisible hypothesis is removed. However, the situation is not as good 
as thought. At first, the complexity of this problem has greatly increased if 
introduce X. At second, actually the invisible hypothesis is almost ignored 
by people in the applications of Dempster-Shafer theory in order to reduce 
the complexity. At third, the proposed D numbers theory is congenitally able 
to well handle the situation of information incompleteness. In the following, 
we will investigate this example using D numbers theory. 

Now let us consider this problem by using D numbers theory. Accord- 
ing to the two pieces of information given by the diagnostic reports, two D 
numbers can be derived that 
Di(F) — 0.7, £>!(£, £7) = 0.2; 

D 2 {F) = 0.5, D 2 (B) = 0.3. 

It is noted that the unknown information is not assigned to any set. The 
constructed two D numbers are in the forms of information incompleteness. 
Q 1 = D 1 (F) + D 1 (B,U) = 0.9; 

Q 2 — D-ziF) + Dz^B) = 0 . 8 . 
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Table 2: Intersection table to combine D w and D s 



D\ 0 Do 


D 2 (F) = 0.5 


D 2 {B) = 0.3 


D t (F) = 0.7 


{F}(0.35) 


0(0.21) 


D^B, U) = 0.2 


0(0.10) 


{A?} (0.06) 



The intersection table of DiQD 2 is shown in Table [2] According to Table 
[21 we can calculate that 

K d = (D^B, U)D 2 (F) + Di(F)D 2 (B)) = 0.4306; 

D(F) = yt^Di(F)D 2 (F) = 0.6147; 

D(B) = U)D 2 {B) = 0.1054. 

with Q = D(F) + D(B ) = 0.72. 

The result also shows the flu is the most probable disease the patient 
got. By comparison with Dempster-Shafer theory, however, in the proposed 
D numbers theory the unknown is inherited during the reasoning. D num- 
bers theory has inherent advantage to handle the situation of information 
incompleteness, which is more natural and reasonable. 

Example 3. Pattern recognition is a key technology in machine learning and 
many other fields. Supposing there is an object X which certainly belongs to 
one of three classes indicated by {A, B, C}. The weight sensor reports that 
X belongs to class A with a certainty of 0.6, and belongs to class C with a 
certainty of 0.4. The shape sensor reports that it is 0.7 sure that X belongs 
to classes A, B and the reminder 0.3 is with completely ignorance. 

Dempster-Shafer theory can be used in this case. Due to the object 
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certainly belongs to one of these three classes, the FOD can be determined 
that = {A, B,C}. According to the reports of weight sensor and shape 
sensor, two BPAs are obtained that: 
m. w {A ) = 0.6, m w (C) = 0.4; 
m s (A , B) = 0.7, m s (A, B , C) = 0.3. 



Table 3: Intersection table to combine m w and m s 



m w © m s 


m s (A, B ) = 0.7 


m s (A, B, C ) = 0.3 


m w (A) = 0.6 


M}(0.42) 


MH0.18) 


m w (C ) = 0.4 


0(0.28) 


{C}(0.12) 



The intersection table of m w © m s is shown in Table |3j Then, we can 
obtain that 

K = m w (C)m s (A, B ) = 0.28; 

m(A) = (m w (A)m s (A, B) + m w (A)m s (A, B , C)) = 0.8333; 

m(C) = y -^m w (C)m s (A, B,C) = 0.1667. 

Namely, the object X is with 0.8333 certainty belonging to class A, and 
with 0.1667 certainty belonging to class C by using Dempster-Shafer theory 
to combine the reports of weight sensor and shape sensor. 

Now let’s consider this problem using D numbers theory. At first, two D 
numbers can be derived to express the reports of weight sensor and shape 
sensor, respectively. 

D W (A) = 0.6, D W (C) = 0.4; 

D S (A, B ) = 0.7, D S (A, B, C ) = 0.3. 
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Table 4: Intersection table to combine D w and D s 



D w 0 D s 


D a {A,B) = 0.7 


D S (A, B, C) = 0.3 


D W (A ) = 0.6 


{-41(0.42) 


{-41(0.18) 


D W (C) = 0.4 


0(0.28) 


{C}(0.12) 



Note that due to X belongs to one of {A, B , C} with 100% certainty, for 
the shape sensor’s report, the remaining possibility 0.3 can be assigned to 
set {A,B,C}, namely D S (A, B,C) = 0.3. It is reasonable. The intersection 
table of D w © D s is shown in Table [4] So, we can calculate that 

Qw — D W (A) + D W (C) = 1.0; 

Qs — D S (A, B ) + -D S (A, B , C) = 1.0; 

K d = q^D w {C)D s (A,B) = 0.28; 

D(A) = x ^ Kd ( D w (A)D s (A , B) + D W (A)D S (A , B , C )) = 0.8333; 

D (°) = 1 ^D w (C)D s (A, B, C ) = 0.1667. 

The result derived from D numbers theory is identical with that of Dempster- 
Shafer theory. In this example, the set {A, B,C} is mutually exclusive and 
collectively exhaustive for this problem, and the two pieces of information are 
complete, therefore, both Dempster-Shafer theory and D numbers theory can 
handle this case, and D numbers theory has degenerated to Dempster-Shafer 
theory. 
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5. Conclusions 



In this paper, a novel theory called D numbers theory is systematically 
proposed. The proposed D numbers theory is an generalization of Dempster- 
Shafer theory, which releases the FOD’s exclusiveness hypothesis and BPA’s 
completeness constraint in Dempster-Shafer theory. In the D numbers the- 
ory, D number is an extension of BPA, D numbers combination rule is an 
extension of Dempster’s rule of combination. Some numerical examples have 
been given to show the application of the proposed theory. In the future 
research direction, one the one hand, the properties of D numbers theory 
will be further studied. On the other hand, this theory will be applied to 
many real applications. 
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